Content-Based Table Retrieval for Web Queries

نویسندگان

  • Zhao Yan
  • Duyu Tang
  • Nan Duan
  • Jun-Wei Bao
  • Yuanhua Lv
  • Ming Zhou
  • Zhoujun Li
چکیده

Understanding the connections between unstructured text and semi-structured table is an important yet neglected problem in natural language processing. In this work, we focus on content-based table retrieval. Given a query, the task is to find the most relevant table from a collection of tables. Further progress towards improving this area requires powerful models of semantic matching and richer training and evaluation resources. To remedy this, we present a ranking based approach, and implement both carefully designed features and neural network architectures to measure the relevance between a query and the content of a table. Furthermore, we release an open-domain dataset that includes 21,113 web queries for 273,816 tables. We conduct comprehensive experiments on both real world and synthetic datasets. Results verify the effectiveness of our approach and present the challenges for this task.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Investigating the Impact of Authors’ Rank in Bibliographic Networks on Expertise Retrieval

Background and Aim: this research investigates the impact of authors’ rank in Bibliographic networks on document-centered model of Expertise Retrieval. Its purpose is to find out what kind of authors’ ranking in bibliographic networks can improve the performance of document-centered model.   Methodology: Current research is an experimental one. To operationalize research goals, a new test colle...

متن کامل

An analysis of failed queries for web image retrieval

This paper examines a large number of failed queries submitted to a web image search engine, including real users’ search terms and written requests. The results show that failed image queries have a much higher specificity than successful queries because users often employ various refined types to specify their queries. The study explores the refined types further, and finds that failed querie...

متن کامل

A Visual Ontology Query Interface for Content- Based Image Retrieval

Various querying techniques have been developed for content-based image retrieval. We propose a Visual Ontology Query Interface for querying an OWL ontology built using content-based image retrieval techniques. With the query interface, users are able to formulate various ontology queries without having to know SPARQL, an ontology query language proposed by The World Wide Web Consortium.

متن کامل

Bayesian Semantics Incorporation to Web Content for Natural Language Information Retrieval

For the present work, we endeavor with the important aspect of information retrieval of Web content using natural language queries. Currently, markup languages and formalisms do not fully provide mechanisms for effective and accurate analysis of Web content but rather provide means for describing the content in a more human-centric approach. As a result, natural language queries cannot be handl...

متن کامل

Content-Based Image Retrieval over the Web Using Query by Sketch and Relevance Feedback

This paper investigates the combined use of query by sketch and relevance feedback as techniques to ease user interaction and improve retrieval effectiveness in content-based image retrieval over the World Wide Web. To substantiate our ideas we implemented DrawSearch, a prototype image retrieval by content system that uses color, shape and texture to index and retrieve images. The system avails...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1706.02427  شماره 

صفحات  -

تاریخ انتشار 2017